A comparative study of constrained and unconstrained approaches for segmentation of speech signal

نویسندگان

  • Venkatesh Keri
  • Kishore Prahallad
چکیده

In this work, we compare different approaches for speech segmentation, of which some are constrained and the remaining are unconstrained by phone transcript. A high accuracy speech segmentation can be obtained by approaches constrained by phone transcript such as HMM forced-alignment when exact phone transcript is known. But such approaches have to adjust with canonical phone transcript, as exact phone transcript is tough to obtain. Our experiments on TIMIT corpus demonstrate that ANN and HMM phone-loop based unconstrained approaches, perform better than HMM forced-alignment based approach constrained by canonical phone transcript. Finally a detailed error analysis of these approaches is reported.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

A Time-Frequency approach for EEG signal segmentation

The record of human brain neural activities, namely electroencephalogram (EEG), is generally known as a non-stationary and nonlinear signal. In many applications, it is useful to divide the EEGs into segments within which the signals can be considered stationary. Combination of empirical mode decomposition (EMD) and Hilbert transform, called Hilbert-Huang transform (HHT), is a new and powerful ...

متن کامل

Comparative Performance Study of Tuned Liquid Column Ball Damper for Excessive Liquid Displacement on Response Reduction of Structure

The tuned liquid column damper (TLCD) having a uniform cross-sectional tube of U-shaped, occupied with liquid is used as a vibrational response mitigation device. The tuned liquid column ball damper (TLCBD) is a modified TLCD, where, an immovable orifice, positioned at the middle part of the horizontal portion, is replaced by a metal ball. Different studies on the unconstrained optimization per...

متن کامل

An Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform

In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010